Evolutionary Online Data Mining: An Investigation in a Dynamic Environment
نویسندگان
چکیده
Recently, traditional data mining algorithms are challenged by two problems: streaming data, and changes in the hidden context. These challenges emerged from real-world applications such as network intrusion detection, credit card fraud detection, etc. Online or incremental learning becomes more important than ever for dealing with these problems. This chapter investigates XCS, an evolutionary learning classifier system, that offers an incremental learning ability and also is able to handle an infinite amount of continuously arriving data. XCS has been tested on many data mining problems and demonstrated as a potential online data mining approach. Most experiments with XCS assume a static environment. Since environments are more likely to be dynamic in real life, noise and environmental factors need to be taken into account in a good data mining approach. This chapter investigates XCS in dynamic environments, in the presence of noise in the training data. An essential requirement of an algorithm in dynamic environments is to be able to recover quickly from hidden changes, while reusing previous knowledge. Our results show that XCS is capable of recovering quickly from small changes in the underlying concepts. However, it requires significant time to re-learn a model after severe changes. We propose several strategies to force the system to learn quickly after severe changes. There are adaptive learning; re-initializing the parameters; and re-initializing the population. Experiments show improvement in the predictive performance of XCS, when compared to the traditional XCS.
منابع مشابه
Evolutionary Computing Assisted Wireless Sensor Network Mining for QoS-Centric and Energy-efficient Routing Protocol
The exponential rise in wireless communication demands and allied applications have revitalized academia-industries to develop more efficient routing protocols. Wireless Sensor Network (WSN) being battery operated network, it often undergoes node death-causing pre-ma...
متن کاملMulti-layer Clustering Topology Design in Densely Deployed Wireless Sensor Network using Evolutionary Algorithms
Due to the resource constraint and dynamic parameters, reducing energy consumption became the most important issues of wireless sensor networks topology design. All proposed hierarchy methods cluster a WSN in different cluster layers in one step of evolutionary algorithm usage with complicated parameters which may lead to reducing efficiency and performance. In fact, in WSNs topology, increasin...
متن کاملData Mining Meets Evolutionary Computation: A New Framework for Dynamic and Scalable Evolutionary Data Mining based on Non-Stationary Function Optimization
Data mining has recently attracted attention as a set of efficient techniques that can discover patterns from huge data. More recent advancements in collecting massive evolving data streams created a crucial need for dynamic data mining. In this paper, we present a genetic algorithm based on a new representation mechanism that allows several phenotypes to be simultaneously expressed to differen...
متن کاملDynamic and Scalable Evolutionary Data Mining: An Approach Based on a Self-Adaptive Multiple Expression Mechanism
Data mining has recently attracted attention as a set of efficient techniques that can discover patterns from huge data. More recent advancements in collecting massive evolving data streams created a crucial need for dynamic data mining. In this paper, we present a genetic algorithm based on a new representation mechanism, that allows several phenotypes to be simultaneously expressed to differe...
متن کاملCustomer lifetime value model in an online toy store
Business all around the world uses different approaches to know their customers, segment them and formulate suitable strategies for them. One of these approaches is calculating the value of each customer for the company. In this paper by calculating Customer Lifetime Value (CLV) for individual customers of an online toy store named Alakdolak, three customer segments are extracted. The level of ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007